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X> . 1 Motivation: 

a" 

Most of the statistical tests currently used to detect differentially ex- 
pressed genes are based on asymptotic results, and perform poorly for low 
expression tags. Another problem is the common use of a single canonical 
cutoff for the significance level (p-value) of all the tags, without taking 
into consideration the type If error and the highly variable character of 
' the sample size of the tags. 

cn 

\6 ■ 2 Results: 

O . 

This work reports the development of two significance tests for the com- 
parison of digital expression profiles, based on frequentist and Bayesian 
points of view, respectively. Both tests are exact, and do not use any 
asymptotic considerations, thus producing more correct results for low fre- 
' quency tags than the \ 2 test. The frequentist test uses a tag-customized 

critical level which minimizes a linear combination of type I and type II 
errors. A comparison of the Bayesian and the frequentist tests revealed 
that they are linked by a Beta distribution function. These tests can 
be used alone or in conjunction, and represent an improvement over the 
currently available methods for comparing digital profiles. 



3 Availability: 

Implementations of both tests are available under the GNU General Public 
License at http://code.google.eom/p/kempbasu 
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5 Introduction 

Crucial events in the biology of living organisms, such as cell differentiation and 
specialization, depend on minute variations of gene expression under different 
conditions and/or temporal events. A key approach to elucidate a gene function 
is to quantify and compare the expression level of a large set of genes in differ- 
ent tissues or developmental stages, or under different conditions/treatments. 
This task can be performed using large-scale hybridization to microarrays, or 
by counting gene tags or signatures us ing methods such as Se rial Analysis of 
Gene Expression (SAGE), developed by I Velculescu et ~cdl (Il995|). an d Mass ively 
Parallel Signature Sequencing (MPSS), described bv lBrenner et all (|2000f h By 
comparing transcript expression profiles among different samples, one can iden- 
tify differentially expressed genes associated with a particular tissue and/or 
condition. Unlike microarrays analysis, SAGE and MPSS do not require any 
prior knowledge of the transcript sequences. These techniques provide a digital 
profiling, and permit to estimate the relative abundance of mRNA molecules 
of a transcriptome based upon two main premises. First, these methods as- 
sume that each position-specific short sequence tag can unequivocally identify 
its corresponding transcript. Second, that the tag counts are representative 
of the abundances of the corresponding mRNAs of the transcriptome, that is, 
every mRNA copy has the same chance of being counted as the correspond- 
ing tag of the library. The selection of a specific tag sequence from the total 
pool of transcripts ca n be well approximated as a sampling with replacement 
( Stollberg et aLl . l2000h . 



Several statistical tests have been devised to deal with the problem of com- 
paring digital expression p r ofiles and ident i fying differentially expressed genes 



paring digital expression pronics and ldentiiymg differentially expressed genes 
(lAudic and ClaverieL Il997t iBaggerlv et all |2004 [Robinson and Smvthl. 12007 



Stekel et a/.l. l2000l: iThygesen and ZwindermanL |2006[ IVencio et ali\2004, an d 
reviewed bv lMan et all (|2000l) iRomualdi et all <|200lb and lRuiiter et all (|2002l ). 

Most of these tests, including the x 2 test as the most classical representative, 
rely on asymptotic methods and, as such, perform poorly when the sample size 
is small, as is the case of low expression tags. Another problematic aspect of 
detecting differentially expressed genes by significance tests, is the use of a sin- 
gle critical level, such as the canonical values 0.1, 0.05 and 0.01, which, apart 
from the common use, do no present any particular advantage. In this work we 
report the development of two distinct exact significance tests for comparative 
studies of digital expression profiles. The methods are based on the frequen- 
tist and Bayesian points of view, respectively, and are fully implemented on 
open source programs. Since these significance tests do not use any asymptotic 
considerations, they produce more correct results for low frequency tags than 
asymptotic methods. Also, the frcqucntist test uses a tag-customized critical 
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level which minimizes a linear combination of type I and type II errors. We 
provide evidences that both methods present very similar results and represent 
an improvement over the currently available statistical tests. 



6 Methods 



6.1 Prequentist Significance Test: p-value 



As a first approach to address the problem of detecting differentially expressed 
tags, we propose an exact and novel frequentist significance test. Assuming k 
(> I) libraries, let M be the number of distinct tags and Nj the total number 
of tags in library j (j — 1, 2, • • • , k). The frequency of the i-th tag in the j-th 



library is denoted by Xy. Hence, 
statistical model can be stated as: 



X 



X 



2j 



X 



M 3 - 



The basic 



• For each j, the random vector X.j 

as a multinomial with parameters Nj and Pj 



(Xij ; X 2 j ; 



Xnij ) is distributed 
(pi 3 ;P2j-,- ■ ■ ;pmj)- 



• The random vectors X.i, X.2, • • • , X.& are mutually statistically indepen- 
dent, that is, we assume that the libraries have been collected indepen- 
dently. 

• Considering that, for digital expression profiles, parameters P ., and Nj 
usually assume l ow and high values, res pectively, several authors ( Audic and Claveriel 



1997t ICai et all 12004 IZhu et all 120081 ) have considered that the distribu- 
tion of tag frequency Xij can be approximated to a Poisson with mean 
NjPij. 

• Assuming that the libraries have been collected independently, the full 
model for a single tag i is 



Pr {X tl = x a , ■ ■ ■ , X lk = x ik \p a ,p ik } = 

1 — , ^ , fc ' exp(Nxpn + h NkPik)' 1 



6.1.1 Partial Likelihood 

One can now write the equation fl} using an alternative parametrization. Let 
the new parameters be 9i = Nipn + • ■ • + N k p ik and IT = (tTji! ' ' ' j n ik)j with 
iTij = NjPij/0i. Assuming the random variable Yi = Xn + • ■ ■ + X ik and the 
observation yi = xn + ■ • • + Xi k , the two following events are equivalent: 



{Aii = xn , ■ • • , Xik = Xik } 
{Aii = xn, ■ • • , Ai fe = x ik ;Yi = y,} . 
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Therefore, the alternative statistical model can be written as: 



Pr {Xn = x a , ■ ■ ■ , X lk = x ik , Yi = y;|IT, 6>;} 



w 1 9 y e~ y 
'" M*** (2) 



Xi\ \ • • • Xik • 



With this new parametrization, the full likelihood becomes the product of a 
multinomial probability function by a Poisson probability function. It is note- 
worthy that the new parameters, IT and 9i, are variation inde pendent, i.e., their 
valu e s carr y no information about each other. According to iBasul (|1977j ) and 
Coxl (|l975f) . to perform inference about II; (or 6i) one only has to consider as 
the likelihood the multinomial (or Poisson) factor of equation 

Under the full likelihood model, the null hypothesis - tag i has the same 
expression in all libraries - is 



H o : Pi = (Pii, • • • ,Pik) = (Pi, ■ ■ ■ ,Pi)- 



(3) 



With the new parametrization, the null hypothesis is reduced to a simple 
hypothesis as follows: 



Hq : Hi = U — — , • • • , — 



N 



N 



(4) 



This approach of partial likelihood, introduced by I Cox (1975), simplifies 
considerably the problem of comparing the expression of the j-th tag in all k 
libraries. Hence, under the null hypothesis, the likelihood is simply a multino- 
mial probability function evaluated for IIo. Being = {Xn,--- , Xik), the 
distribution under the null and alternative hypotheses are, respectively, 



H' 



and 



Vr{x % .\Y % = m iu = n } = -^-II^T ( 5 ) 



H' 1 :Pr{X i .\Y i =y i ;U i } = y i [l[ 



3 = 1 



(6) 



6.1.2 Significance Level 

According to Cox (1977) and Kempthorne ( 19761 ). a significance test is a method 
that measures the consistency of the data with the null hypothesis. The com- 
monly used index to perform this task is the well known p-value. We refer to 
Kempthorne and Folks! ( 1971 ) for important discussions on the evaluation of 
p-values. For a random vector X, let T(X) = T be a statistic in which small 
values of T cast doubt about Hq. For an observation x with T(x) = t, the 
associated p-value is the probability under Hq of the event {T < t}, that is, 
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p = Pr{T < t\Ho}. The consequence of this definition is that the random vari- 
able T must be a function that produces an order in the sample space. In this 
ordered sample space, the sample points with low order favor the alternative 
hypothesis, whereas those with a higher order support the null hypothesis. 

As we will show below, the likelihood ratio is an appropriate statistic for 
ordering the sample space relative to the null hypothesis. Let i?(X) be the 
maximum of the likelihood function under H$ divided by the overall maximum 
of the likelihood function (eq. [7]), and i?(x) = r be the value of that statistic at 
the observation x, the p-value is Pr{i? < r\Ho}. 

*c*>-(3)"n$r ,7) 

3=1 V 3 7 

As one can conclude, R has the desired property of ordering the sample 
space according to the support of H Q . The use of likelihood ratios f o r com - 
puting p-values has been a lread y addressed by Nevman and Pearson] ( 19281 ). 



Pereira and Wechsler ( 19931 ). and iDempster ( 19971 ). 



Let's define the tail set, Tj, of frequencies more extreme than Xj: 

Ti = {w = (wi, ■ ■ ■ ,w k )\wi H hrat=!/,V R(w) < -R(xj)} . 

The p-value of the tag i, pvi, that provides the significance level for Hq when 
the observation is Xj, is: 



weTi j=i 
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A serious limitation of this exact p-value calculation is that the number 
of points in the sample space grows exponentially in regard to the number of 
dimensions. To overcome this problem, we use an algorithm based on the Monte 
Carlo method: 



p-value(X, N, runs) 

1 i<-i?(X,N) 

2 y^Y,Xi 

3 p^N/EiVi 

4 c^O 

5 for i <— 1 to runs 

6 do W <— Random vector with Multi(y,p) 

7 if-R(W,N)<f 

8 then c *— c + 1 

9 return c/runs 



In order to test and validate this method, we developed Kemp, a C language 
program named after Prof. Oscar Kempthorne, an English statistician who has 
produced an extensive work on the topic of significance tests. 
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6.1.3 Critical significance level 

In order to have inferential meaning, the evaluation of a significance index should 
help one to decide in favor or against a null hypothesis. Hence, a decision rule 
must be stated. For instance, the critical level is the cutoff between reject/accept 
actions. In any digital expression profile, the relative abundance can vary dra- 
matically from tag to tag, implying that the use of a single critical significance 
level for all tags may be unfair for those tags with low frequencies. For this 
reason, we decided to calculate a critical level for each particular tag, accord- 
ing to the recommendations of IPeGrootl (|l986l ). Thus, we used the optimum 



procedure of the decision theory, which minimizes the risk function aa + b/3, a 
linear combination of the two kinds of errors: a and (3, corresponding to errors 
of type I (false positive) and type II (false negative), respectively. 

The value of a is the probability of the critical region using the parameter 
value defined by the null hypothesis. Conversely, computation of (3 is more com- 
plex, since H\ is a composed hypothesis rather than a single point hypothesis. 
To solve the problem of defining the appropriate j3, we considered the average 
of all possible single hypotheses within the set of alternative hypotheses. To 
perform this computation, we used a uniform prior for the parameter of the 
alternative hypothesis, and considered the predictive distribution for this prior 
choice. Fortunately, the predictive is a uniform discrete distribution in the sam- 
ple space and, hence, is a constant equal to the inverse of the number of points 
of the sample space. Therefore, the aforementioned average of j3 is the number 
of points within the acceptance region, divided by this constant. 

To choose the critical level, we considered, for a given value of y, all possible 
critical regions. The critical level is then the value of a for the critical region 
that gives the smallest value of each combination aa + bp. We also defined an 
arbitrary score S (eq. [9J, based on practical results, to order the differentially 
expressed tags according to the relative "distance" between their corresponding 
p-value and critical level. 

*-lo(l-=) (9) 
6.2 Bayesian Significance Test: e-value 

The F BST, Full Bayesian Significance Test, was introduced bv lPereira and Stern 



( 19991 ). The objective of this method is to obtain an alternative index to p- 



values, namely e-values (where e stands for evidence). Both indices vary from 
zero to one, since they represent probabilities on the sample space (p-value) and 
on the parameter space (e-value). The Bayesian test is defined on the original 
parameters Pi, • • • , P/c of the libraries. As a prior for these parameters, we con- 
sidered independent and identical Dirichlct distributions with metaparameters 
A\, ■ ■ ■ , Am- Consequently, the posterior distribution for each indep endent pa- 



rame ter Pj is Dirichlet with metaparameters xij-\-Ai, ■ ■ ■ , Xmj+Am (jAitchisonl . 
120031 ). For this model, the posterior marginal density of the parameter pij is 
Beta(xij + Ai, Nj + S — Xij — A%), with S = A± + ■ ■ ■ + A n . From the indepen- 
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dency of Pj, the joint probability of all libraries, for each single tag, is a product 
of these Beta densities. 

To perform this test, one needs two numerical procedures: optimization 
and integration. The low values of x\j + A4, compared to the high values of 
Nj+S— Xij— Ai, usually lead to an underflow in the numerical integrat ion. Thus , 



we tr ansformed the Beta densities into logistic-normal distributions (|Aitchisonl . 



2003). The mean and the variance of the normal distributions, obtained after 



the logistic transformation, Qj = log[pij/(l — Pij)], are presented in eq. QJJ 
and eq. [Til respectively, where \I/ is the digamma and and is the trigamma 
functions. 

IHj = y(xij + Ai) + ^(Nj -S- x i:j - Ai) (10) 
4 - 9\ Xij + A,) - ^'(Nj -S- Xij - A,) (11) 

Notice that the null hypothesis Cii = ' " = dk is equivalent to the original 
hypothesis pn = ■■■ = p^. By using this approach, the integration of the 
product of Beta distributions is replaced by a integration of normal distributions, 
whose parameter values avoid numerical representation problems that could 
arise when using Beta distributions. 

The Bayesian significance test was implemented on Basu, a C language pro- 
gram named after Prof. Debabrata Basu, an Indian statistician who motivate d 
CABP to create the Full Bayesian Significance Test ( Pereira and Stern . 1999f ). 



7 Results 

7.1 The critical level as a function of y 

In order to establish an automatic procedure to discriminate tags according to 
their differential expression status, we defined two sets of weights for type I and 
type II errors, respectively: (a=l, b=l) and (a=4, b=l). These values were 
arbitrarily defined based on several analyses of SAGE (serial analysis of gene 
expression) data from human libraries, followed by experimental validations 
with real-time PCR (data not shown). The a critical levels were computed for a 
range of possible values of y (the total tag frequency) , and for k values (number 
of libraries) ranging from 2 to 5. Since the calculation of a is a computer 
intensive task, we estimated a polynomial approximation of this critical level 
for each value of k. This result was incorporated on Kemp, our implemented 
software for the frequentist significance test, and is available in the Appendix. 
Figures 1 and 2 present the dilog graphics of the critical level values and the 
corresponding adjusted functions, assuming k values of 2 and 5, respectively. 

As can be seen, the set of weights (a=4,b=l) generates critical level values 
that are consistently lower than those obtained using the set (a=l,b=l). This 
result can be ascribed to weight 4 used for the a error, which leads to a greater 
minimization of type I error. Also, for both sets of weights, when y presents 
high values, the critical level is much more stringent than the canonical values 
0.1, 0.05 and 0.01. 
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Figure 1: Dilog graphic with the simulated values of the critical level and the 
fitted function for k—2. Critical levels calculated with weights (a=l,b=l) are 
indicated by O, and with weights (a=4,b=l) are indicated by +. 



7.2 Comparison between Kemp p-value and x 2 p-value 

The x 2 homogeneity test is widely us ed and desc r ibed in the literature for com - 
parison of digital expression profiles (|Man et 

aD, l2000l : iRomualdi etail l200lh . 



We decided to compare our frequentist significance level (p-value) to the % 2 
test. We used a data set composed of four SAGE libraries derived from hu- 
man brain tissues, and potentially containing genes involved in increased risk of 
Alzheimers Disease (GEO accession code GSE6677). The tags were arbitrarily 
separated into two groups according to their total frequency (y). Thus, we cal- 
culated the significance levels using the Kemp method and the \ 2 test for both, 
the high-expression (y > 50, Fig. [3]) and low expression (y < 50, Fig. 2]) tags. 
In Fig. [31 we can see that there is a good agreement between the significance 
levels when tags present a high expression level. Conversely, Fig. 2] shows that 
when the expression of the tags is relatively low, the levels present a much lower 
agreement. This result is in consonance with what we should expect, since the 
X 2 test is asymptotic, whereas our proposed significance test is exact. 

7.3 Comparison between Kemp p-value and Basu e-value 

Aiming at estimating the consistency of the proposed methods, we compared 
the p-value (the frequentist significance level), and the e-value (the Bayesian 
significance level) using the four SAGE libraries described in the section 17.21 
To obtain the relationship between p-values and e-values we calculated the 
local weighted average of the e-values, using the values of y as weights and a 
p-value intervals of 0.04. This calculation resulted into pairs (p, e), where p 
is the center of the interval, and e the weighted average. We then adjusted 
a Beta distribution function to these pairs. The best fit obtained was a Beta 
with parameters a = 0.66 e b = 1.036, corresponding to a mean of 0.39 and a 
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Figure 2: Dilog graphic with the simulated values of the critical level and the 
fitted functions for k=5. Critical levels calculated with weights (a=l,b=l) are 
indicated by O, and with weights (a=4,b=l) are indicated by +. 

standard deviation of 0.30. Fig. [5] displays a plot of the relationship between 
both significance levels, and the fitted function. We can notice that there is 
a good agreement between the p-value and e-value. Some differences observed 
between their values can be explained by the fact that the p-value is an integral 
in the sample space, whereas the e-value is an integral in the parameter space. 
Despite this difference, it is clear that both significance levels are most of times 
convergent. 



8 Discussion 



This paper introduces two novel methods for the comparison of digital expres- 
sion profiles. The methods, based on frequentist and Bayesian statistics, were 
implemented on the open source programs Kemp and Basu, respectively. Several 
statistical tests have been used to evaluate SAGE data and identify differentially 
expressed tags. Some of these tests have been compared b y different groups 
(|Man et all . l2000l : iRomualdi eTall . l200ll : iRuiiter et al\ . I2002I ) . The general con- 
clusion was tha t the classical x 2 test, originally introduced by Karl Pearson 
(|Pearsonl [19001. was equivalent to and even outperformed other availa ble tests, 
including the Fischer's Exact tes t, the test of lAudic and Claverie (1997) and the 
R statistic of Steke l et al. I (l2000h . The x 2 test has the advantage of being simple 
and can be applied to a broad range of problems. However, given the asymptotic 
character of the x 2 test, it is not recommended for the analysis of low frequency 
tags. Due to this feature, we conceived our frequentist test using the original 
definition of more extreme sample points, without any asymptotic result. As a 
consequence, our test is more correct than the x 2 test for low expression tags. 
Corroborating this fact, a comparison of the p-values calculated by Kemp, and 
the x 2 p-value, showed a good agreement (Fig. [3] and Fig. |4]) only for high ex- 



9 




pression tags. For low frequency tags, when the x 2 test becomes inappropriate, 
X 2 values showed a high disagreement with our calculated p-values (Fig. H}. 
Thus, we believe that the frequentist test proposed here, and implemented on 

Kemp, represents an im provement for the analysis of digital expression profiles. 

So me other methods dBaggerlv etaH l2004t [Robinson and Smvthl . 120071 : iThvgese n and Zwindermanl . 
2006t Vencio et al. . 20oJ ) have been proposed for the comparative analysis of 



SAGE data. However, since these methods are designed for comparing groups 
of libraries, their use is severely restricted in experiments where a single library 
is represented in each category/condition. 

The discrimination between high and low expression tags must be performed 
in such a way as to consider both statistical and biological relevance. Since 
housekeeping genes are expressed in high levels, the absolute number of counts 
may present a considerable variation across distinct libraries. These differences, 
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Figure 5: Relationship between p-value and e-value. O indicate the pairs (p, e) 
and the curve the fitted function. 



nevertheless, are meaningless from a biological standpoint. Conversely, some 
functionally important genes present a relatively low expression, and exert their 
activity by altering tin y amounts o f their expression among the different tis- 
sues and/or conditions ( Wang . 2006h . Therefore, a tag presenting a differential 
expression with low counts would not be considered as significant by methods 
that use fixed critical levels, thus leading to a misinterpretation of the data and 
loss of potentially valuable information. This fact motivated us to calculate the 
critical level of each particular tag taking into account its total frequency. If 
population parameters are not exactly in the null sharp hypothesis set (a set 
with a smaller dimension than the alternative hypothesis set), highly expressed 
tags have smaller p-values than low expression tags. If one fixes the critical level, 
the minimum type I error, the type II error decreases drastically when the sam- 
ple size is increased. Hence, it becomes difficult to accept a null hypothesis for 
large sample sizes, and to reject it for small-sized samples. For example, when 
comparing the expression of two 10,000-tag libraries, a tag presenting counts of 
7 and 21, respectively, would show a p-value of 0.013. Conversely, a tag with 
counts 10 and 30 would result in a p-value of 0.002. Considering a cutoff of 
0.01, the former tag would be considered as equally expressed in both libraries, 
whereas the former tag would be interpreted as being differentially expressed. 
This fact motivated us to calculate the critical level of each particular tag as a 
function of the tag total frequency. The critical region in our method is the one 
that minimizes a linear combination of type I (the critical level a) and type II 
((3) errors. With this tag-customized approach, both tags of the example above 
would be classified as differentially expressed, since their corresponding critical 
levels would be 0.015 and 0.013, respectively. The method is still coherent, since 
tags with very low frequencies, even presenting differential counts, lead to high 
significance levels. For instance, in the aforementioned example, tag counts of 1 
and 3 would result in a p-value of 0.63, a much higher value than the calculated 
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cutoff of 0.03. Concluding, our method judges the tags in a fairer manner, since 
the cutoff value is customized to any particular tag, according to its expression 
level. 

The problem of significance testing of precise (sharp) hypotheses has been 
controversial and both, the frequentist and Bayesian schools of statistical infer- 
ence, have offered solutions. As a counterpart to the frequentist test introduced 
in this work, we decided to also offer an alternative method, based on the previ- 
ously described (jPereira et all 120081 ) Full Bayesian Significance Test (FBST). A 
clear advantage of a Bayesian test, applied to digital expression profiling, is that 
the total frequencies of the tags do not have to be fixed in advance and are, in 
fact, unknown before the observation of the libraries. This Bayesian procedure 
does not require any other assumption in addition to the original multivariate 
Bernoulli observations that produce the libraries. From a Bayesian standpoint, 
it is important to judge hypotheses in their own environment, which is the pa- 
rameter space, not the sample space. With a more pragmatic view on mind, we 
tried to check if the frequentist and Bayesian tests can be related, even though 
they are defined in different spaces. In this direction, we implemented the FBST 
in the Basu program, and compared the e-values to the p-values previously de- 
termined by Kemp (the frequentist test program), using SAGE libraries. To our 
surprise, a strong correspondence between the averages of p-values and e-values 
has been observed (Fig. [5]). This result indicates that both methodologies can 
be reliably used to identify differentially expressed tags. Also, because they 
lead to similar results using totally different approaches, we believe that both 
methods can be used in parallel to validate each other's results. 

Kemp method fully implements a decision procedure, since it provides a 
significance level and a critical level. Conversely, Basu method does not calculate 
the most appropria te cutoff. For t his ta sk, one should follow the decision theory 
steps described by IPereira et aZ| H008), and build a loss function based on a 
good modeling of the risks involved on deciding whether a tag should or not be 
considered as differentially expressed. In this direction, our group is currently 
working on the development of a critical level for the FBST. Alternatively, one 
can use an approximation to determine a cutoff for Basu. Since the p-values 
and e-values are linked by a Beta distribution function, as we have shown for a 
set of SAGE libraries from human brain tissue (Fig. [5]) and some other datasets 
(data not shown), we propose to use the Kemp cutoff, properly adjusted by the 
linking function. 

Concluding, the frequentist and Bayesian significance tests reported in the 
present work, and implemented in standalone open-source programs, extend the 
set of currently available statistical tests for digital expression profiles. Also, we 
believe that they offer some advantages over other reported tests, including a 
more adequate treatment of low expression tags and the automatic calculation 
of a customized critical level. 



12 



System Requirements 



The source code of KempBasu package and a executable binary for MS Windows 
are publicly available at the address http://code.google.eom/p/kempbasu/, 
and are distributed under the GNU General Public License. The code depends 
on glib, GSL and Judy libraries, and if the Pthreads API is available, KempBasu 
can be run using multiple processors. Tested platforms include Linux, MacOSX 
and MS Windows. 
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Appendix 



To establish the function of y that gives the approximate a, we consider the 
pairs {L = Log(y); Log(a)} and use the least squares method piecewisely in 
two difference regions of y values: [1;50] and [51;10,000] . The former region 
adjusts a second degree polynomial 

a k\v) = a kL 2 + hL + c k 
and the latter region adjusts a line 

(2) 

a k (v) = u k + vkL. 

Tables [T] and [2] present the coefficient values for those functions for weights 
(a = 4, b = 1) and (a = 1,6 = 1), respectively . The linear and the quadratic 
functions are combined, for each value of k, into a single continuos function by 
the eq. HU 

(a^\y) para y < 40 

(l-X)a{ 1} (y) + Xa^ ) (y) para 40 < y < 50 (12) 
cijf 1 (y) para y > 50 



k 




h 


Cfe 


Uk 


Vk 


2 


0.00957978 


-0.463118 


-2.76474 


-2.37781 


-0.530119 


3 


-0.304365 


1.18976 


-4.60784 


-0.713611 


-0.968513 


4 


-0.931159 


5.00318 


-10.1863 


0.385118 


-1.28105 


5 


-0.685327 


3.39467 


-7.59502 


1.47602 


-1.57657 


6 


-0.914225 


4.84175 


-9.81444 


1.93518 


-1.70783 



Table 1 : Coefficient values of fitted critical level functions for the minimization 
of 4a + (3 



k 


Ofe 


bk 


Cfe 


u k 


Vk 


2 


0.00748022 


-0.607463 


-0.53588 


-0.629139 


-0.561742 


3 


-0.226299 


0.503742 


-1.7504 


0.677628 


-0.968169 


4 


-0.215143 


0.334093 


-1.38061 


1.79399 


-1.30545 


5 


-0.248689 


0.369967 


-1.13529 


2.62984 


-1.55664 



Table 2: Coefficient values of fitted critical level functions for the minimization 
of a + (3 
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